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Abstract 

Let Q be a family of n-vertex graphs of uniform degree 2 with the property that 
the union of any two member graphs has degree four. We determine the leading 
term in the asymptotics of the largest cardinality of such a family. Several analogous 
problems are discussed. 
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1 Introduction 



Let J- and T> be two disjoint families of graphs on the same vertex set [n] = {1,2,..., n}. 
We denote by M{T , V, n) the largest cardinality of a subfamily Q C J 7 with the property 
that the union of any two of its different members belongs to T>. Here the union of two 
graphs on the same vertex set is the graph whose edge set is the union of those of the 
two graphs. We are especially interested in the cases when T> is monotone in the sense 
that if a graph is in V> then any graph containing it also belongs to T>. (In our context, 
a graph contains an other one if they have the same vertex set and their edge sets are in 
this relation). This framework was introduced in [9]. It represents an attempt to describe 
a consistent part of extremal combinatorics where information theoretic methods seem 
to be relevant. When J 7 and T> are complementary, then it is clear that without loss 
of generality, one can suppose that at least one Q of maximum cardinality consists of 
maximal elements of J 7 . This is true because if we replace a member of Q by any graph 
containing it, the union condition remains satisfied by the monotonicity of T>. Further, if 
we do this for all the non-maximal member graphs, the family so obtained will have the 
same cardinality as the original family, since no two members of Q can be contained in a 
same member of J 7 , for else their union would still be in J 7 , which is impossible by the 
union condition. Therefore, in this Q of maximum cardinality consists of all the 

maximal elements of T . Thus if J 7 and V> are complementary, our problem reduces to 
counting the maximal elements of the graph family J 7 . Many such enumeration problems 
have been studied. In several recent papers of this kind information-theoretic methods 
are used, cf. e. g. [7] dating back such enumeration problems to Dedekind [5] or the 
often rediscovered Kahn-Lovasz theorem [I]. If, however, J 7 and T> are far from being 
complementary, the problems in our framework resemble the graph capacity problem of 
Claude Shannon [T2] . 

There are many situations in mathematics where we are given a set of which we have 
to choose the maximum number of elements any two of which are "more distant" than a 
given threshold. In most of these problems distance is measured in terms of a metric. A 
classical example is the code distance problem in information theory [UJ , where in a set of 
binary sequences of some fixed length we are looking for the largest subset of points any 
two of which are at Hamming distance at least d. However, in Shannon's graph capacity 
problem the elements have to be distant in a structural instead of a metric sense. For 
various generalizations and applications of Shannon's problem we refer the reader to the 
survey article [TU] and the more recent paper pQ . In the first generalizations of Shannon's 
problem one considers a set of "distant" sequences from a finite set, or particular sequences 
from an infinite set, namely, permutations. Permutations led the authors of [9] to extend 
the problem of graph capacity to the search for "distant" Hamilton paths, a rather natural 
representation of permutations. However, the graph representation naturally leads to 
graph-theoretic concepts of diversity and hence problems of an altogether different kind. 
One of the main objectives in this series of papers has been a (so far not very successful) 
search for common patterns in the optimal constructions. 
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In this paper we are studying the growth rate of M(T, V, n) in n in cases of graph 
families defined in terms of degree conditions on the member graphs. In the last section 
we introduce a new graph invariant that generalizes the concept of degree doubling in a 
different direction. 

Note that logarithms and exponentials are to the base 2. 

2 Main result 

In our simplest problem J 7 is the family of connected graphs of uniform degree 2 while T> 
is the family of all graphs of (maximum) degree at least 4 on the common vertex set [n]. 
In other words, the graphs in J 7 are the Hamilton cycles with vertex set [n]. Let us write 
Q(n) = Mf^J 7 , V, n). We will show that, essentially, Q(n) grows like the square root of n\. 
More precisely, we have 

Theorem 1 

(ra-1)! n\ 
2- K2j!(l + v / 2) n " W " [n/2\\2l^\ 

Proof. 

We start by proving the upper bound. To this end, suppose for a moment that n is 
even and let P be a perfect matching, i.e., a graph of uniform degree 1, with vertex set [n\. 
Let further C = C(P) be the family of all Hamilton cycles containing P as a subgraph. 
We claim that the union of any two graphs in C has maximum degree strictly less than 4. 
As a matter of fact, if the union of two graphs has degree 4, then for at least one vertex 
x G [n] the union has degree 4, meaning that the sets of its incident edges in the two 
graphs must be disjoint. However, since the two cycles contain a common edge incident 
to x, the one in the perfect matching P, we have a contradiction. This means that if Q is 
a family of Hamilton cycles in which the union of any two members has degree 4, then Q 
can contain at most one cycle from C. We claim that 

|c| _ {nim nl \ (1) 

To verify this claim note that any linear order of the edges in P combined with any given 
orientation of the edges of P defines an oriented Hamilton path for the vertex set [n]. The 
first vertex of this path is the starting point of the first edge in the linear order, whence 
the path goes to the endpoint of this edge. From here the path continues to the first 
point of the second edge of P, and so on. Since there are (n/2)! orders of the edges in P 
each of which having 2 ra//2 orientations of these edges, we see that the number of oriented 
Hamilton paths on [n] containing a fixed perfect matching is (n/2)!2 n//2 . Every oriented 
Hamilton path gives rise to an oriented Hamilton cycle in the obvious manner, making its 
last vertex adjacent to the first one. We obtain each oriented Hamilton cycle exactly n/2 
times in this manner. Further, each Hamilton cycle so generated will appear with both 
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of its orientations. In conclusion, every Hamilton cycle containing P is obtained n times 
which gives ([T]). 

It is obvious, by symmetry, that every Hamilton cycle is contained in C(P) for the 
same number of perfect matchings P of [n]. Thus, by double counting, considering that 
the total number of Hamilton cycles is ^"~^ ! , we obtain that 



\G\ < 



n/2)!2™/ 2 ' 



Hence, for n even, 



where for the last inequality we use the obvious bound ( n ™ 2 ) < 2 n . 

The case of an odd n is similar. We fix a perfect near-matching P* by which we mean 
a graph of uniform degree 1 with n — 3 vertices and a path connecting the remaining 3 
vertices. As before, we consider the set C(P*) of all the Hamilton cycles containing this 
P*. As in the case of n even, at most one of these cycles can be in any family Q satisfying 
our condition on the degree of pairwise graph unions. The number of Hamilton cycles 
containing our P* is now 

(L^/2J)!2L-/ 2 J 



\C{P* 

Just like in the previous case, we get 



(n-1) 



Q(n) < 



nl 



(Ln/2J)!2L«/2J 



as stated in the theorem. For convenience, we note the following somewhat weaker but 
nicer form of the bound: 

Q(n) < (\n/2])\2^ (2) 

for every n. 

Let us now turn to lower bounding Q(n). To this end, we will use a greedy algorithm 
to exhibit a large enough family of Hamilton cycles with the required properties. At each 
step in the algorithm we choose an arbitrary Hamilton cycle and eliminate from the choice 
space all those incompatible with the chosen one. This procedure goes on until the choice 
space becomes empty. 

To analyze this algorithm, we need an upper bound on the number of the cycles 
incompatible with a fixed Hamilton cycle H. Clearly, a cycle C is incompatible with H 
if and only if the set of their common edges covers all the vertices in [n]. Every such 
covering contains a minimal covering. In a minimal covering the edges are partitioned 
into single edges and paths of two edges. These are the connected components of the 
underlying graph. Obviously, the same covering may contain several minimal coverings. 
Let us fix a minimal covering and let s be the number of its connected components. Then 
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[fl < s < [ n /2j and the number of adjacent edge pairs in the covering is n — 2s. In 
consequence, as in the first part of the proof, we see that the number of Hamilton cycles 
whose intersection with H contains our fixed minimal covering is 

2 s (s- 1)! 

while the number of minimal coverings with s connected components is 

s 

n — 2s 

We conclude that the number of Hamilton cycles that are incompatible with a fixed one 
is upper bounded by 

LfJ 



Notice that 



E L D ! - < 3 > 



K2J , , / [n/2j , \ \ 



Further, rewriting ( n " 2s )2 s = ( 2 ")(V / 2) 2s we see that the right-hand side of our last 
inequality can be further bounded by 



EQnV) Ln/2j! = (l + v^rin/2j!- 



This means that the greedy algorithm will eliminate at most (1 + \^2) n [n/2\\ cycles at 
each step, yielding a cycle family Q with the desired union property and containing at 
least 

(w- 1)! 
2(l + v / 2) n K2j! 
cycles, as claimed for the lower bound. 

□ 

Next we turn to the general case of graphs of uniform degree 2. Let therefore J 7 be 
the family of graphs of constant degree 2, while as before, V is the family of graphs of 
maximum degree 4, on the same vertex set [n\. We denote 

R(n) = M(F,V,n). 

Obviously, 

Q{n) < R(n). 

However, as we shall see, R{n) grows substantially faster than Q(n). As our next 
statement shows, the family of all those graphs whose connected components are triangles 
is essentially optimal. 
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Theorem 2 

c ^ < Bin) < H ' 

where the c in the lower bound is an absolute constant. This constant is 1 if n is a multiple 
of 3. 

Proof. 

Let us begin by establishing the lower bound. For simplicity, let us suppose for the 
time being that n is a multiple of 3. Let F and G be two different graphs with vertex set 
[n] both of which have only triangles as connected components. We claim that their union 
contains at least one vertex of degree 4. In fact, suppose that this is not the case. As we 
have already established, if the union of two graphs of uniform degree 2 does not have 
vertices of degree 4, then this intersection has no isolated points. This then implies that 
the intersection contains at least two edges of every triangle of both graphs. However, 
two edges of a triangle define that triangle. In other words, each triangle of F coincides 
with some triangle of G, and this means that the two graphs coincide; a contradiction. 
As it is easily seen that if n is a multiple of 3, then the number of those graphs on vertex 
set [n] whose connected components are triangles, is 

n\ 



(n/3)! 6™/ 3 ' 

This establishes our lower bound if 3 divides n. In the opposite case write n = 3q + r 
where q, r are integers with 3 < r < 6 and consider those graphs that contain q connected 
components on [3q] and which coincide on the fixed set [n] — [3q] of at most 5 vertices. 
Our previous argument applies on [3q]. 

To prove an almost matching upper bound, let us choose an arbitrary partition p of 
n, i. e., a sequence of non- necessarily distinct natural integers n,, i — 1, 2, . . . , t such that 
Yll=i n i = n - The sequence obtained by any permutation of the indices i is considered an 
other representation of the same partition p. Let us further denote by .F(p) the family of 
those graphs of uniform degree 2 on the vertex set [n] that have t connected components 
with vertex sets of cardinality n,, i — 1, 2, . . . , t. We claim that 

To verify this claim, let k = k(p) be the number of odd integers among the rij in p. 
Consider a graph P(p) with vertex set [n] and having k connected components that are 
3- vertex paths, while the rest are single edges. Then n—3k is even and, by its construction, 
P(p) has no isolated vertices. This implies that the subfamily C(p) of those graphs in 
F(p) which contain P(p) as a subgraph has no two member graphs with a union of degree 
4. Further, by symmetry, we see that constructing such subfamilies for all the different 
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copies of -P(p) on the vertex set [n], we obtain a uniform covering of F(p). This yields, 
as in the proof of the upper bound part of Theorem [H 

M(J-(p),P,n)<^M. (5 ) 

Hence, in order to obtain a proof of we will upper bound the cardinality of .F(p) and 
lower bound that of C(p). It is easy to see that 

, . . n\ n\ 



«2*ir*=in*-«lti"i' 
On the other hand, let I be such that 3k + 2Z = n. Then 

|C(P)I > -£±«U 

n ■ t\ ili=i n< 

Substituting the bounds from the last two inequalities into (jSJ) we obtain 

M(-F(P),XV»)<^ 

Observe that 

, , , n — 3k n — k . 

k + l = k^ = > n/3 

2 2 ~ 1 

where the last inequality follows from the obvious relation k < n/3 which allows us to 
conclude that 

M(T(p),V,n)<^. (6) 



On the other hand, we obviously have 

M(^2?,n)<£M(^(p),D,n), 



p 

where p runs over the partitions of n. We know from the seminal paper of Hardy and 
Ramanujan [6] that the number of the partitions of n is less than — -. Using this estimate 
in (|SJ) brings us to our upper bound 



M{F,V,n) < e^- 



nl 



□ 

We conclude this section by a slight variant of the problem about Hamilton cycles. 
In fact, we ask the same problem for Hamilton paths. Hamilton paths (in the oriented 
case) are a natural representation of permutations and the present problem area grew out 
of the problem of permutation capacity [8]. For the next result let T H be the set of all 
(non-oriented) Hamilton paths on the vertex set [n]. We will show that up to a linear 
constant the largest cardinality of a set of Hamilton paths with the property that the 
union of any two has degree 4 is the same as the analogous quantity for Hamilton cycles. 
More precisely, we have 
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Theorem 3 

M(F, V, n) > M(T H , V, n) > — ^— • M(F, V, n). 

n — 1 

Proof. 

The first inequality follows from our initial observation about maximal elements of a 
graph family. In fact, Hamilton cycles are maximal elements in the family of connected 
graphs of degree 2 on [n]. 

In order to prove the second inequality, let C be an optimal family of Hamilton cycles, 
hence \C\ = M(.F, V,n). Let further, for any pair of distinct vertices {a, b} 6 the 
family C(a,b) consist of those cycles from C that contain the edge {a, b}. Notice that if 
we drop the edge {a, b} from each of the cycles in C(a,b), the resulting Hamilton paths 
satisfy our condition, implying that \C(a,b)\ < M{F n ,T> ', n). We have 

n-M(J r ,V,n) = n-\C\= ^ l C ( a > b )\ ^ ( \ ) M {? H , V, n), 

{a,6}e[n] ^ ' 

completing the proof. 

□ 

3 Degree doubling and graph distinguishability 

Our previous problem on degree doubling is a special case of the following. Let G be an 
arbitrary finite simple graph on n vertices. Without loss of generality we suppose that 
its vertex set is [n]. Let F be a different graph on [n] but isomorphic to G. We will 
say that F and G are Shannon-distinguishable if there exists a vertex x G [n] such that 
its neighborhoods in the two graphs are disjoint. Let us denote by v(G) the maximum 
number of pairwise Shannon-distinguishable copies of G. If G is a Hamilton cycle on 
[n] then v(G) — Q(n). If G is a digraph, then we can replace neigborhood with out- 
neighborhood in the definition. (The out-neigborhood of a vertex is the set of those 
vertices which are the endpoints of edges starting in the vertex). Clearly, v{G) is 1 if 
there are no two vertices in G with disjoint neighborhoods. In case of cycle graphs the 
determination of v{G) is very close to the old question of permutation capacity of [S]. 

More importantly, Shannon's classical problem of graph capacity has a natural formu- 
lation in these terms. Graph capacity corresponds to the highest rate at which information 
can be transmitted over a discrete memoryless stationary channel in an error-free manner. 
In Shannon's information theory a channel is modeled by a stochastic matrix W . The 
rows of the matrix are indexed by the elements of a finite alphabet X and the columns 
by those of a finite alphabet 3^ The element at the crossing of the row of index x and the 
column y, traditionally denoted by W{y\x) is the probability that the transmission of the 
symbol x results in the reception of the symbol y. The repeated use of the channel is char- 
acterized by a similar transmission matrix, denoted W m . If m symbols are transmitted 
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consecutively, then the transmission of the sequence x G X m of input symbols results in 
the reception of y G y m with probability W m (y|x) = YliLi W(yi\xi) where Xi is the z'th 
coordinate of the sequence x and is the z'th coordinate of y. A code C G X m of length 
m is a set of input sequences from X m no two of which can result in the same output 
sequence with positive probability. The rate of C is the m'th root of its cardinality \C\. 
(More precisely, in the information theoretic literature the rate is the binary logarithm of 
this quantity). The supremum of all the code rates, C{W) is the zero-error capacity of 
the channel. We will say that a code is time-symmetric if the codewords are obtainable 
from one another by a suitable permutation of the coordinates. It is well-known and easy 
to see that the supremum of the rates of time-symmetric codes achieves capacity, [2], 
Chapter 11. (Note that in the information-theoretic literature such codes are called fixed 
composition codes). 

Shannon [12] observed that the determination of zero-error capacity can be formulated 
in graph theory. For this purpose one can define a graph Gw with vertex set X in which 
two vertices are adjacent if the corresponding input symbols cannot result in the same 
output symbol with positive probability for both. Correspondingly, one can define the 
graph G w = Gw m - Then the m'root of the largest cardinality of a clique in Gw m is the 
largest rate of a zero-error code for m uses of the channel. The supremum in m of all 
these rates is C(Gw), the capacity of the graph Gw We associate with the matrix W a 
directed graph G = G{W) with vertex set X U y. We draw an edge from a G X U y to 
b G X U y if either W(6|a) > or a G y. In the same manner we have a digraph G m 
for every m and the matrix W m . Given a probability distribution P on X we denote by 
G m (P) the possibly empty graph induced by G m on the set of those vertices in X m in 
which every element a G X appears mP(a) times in the coordinates of the sequences. We 
denote by V(X) the set of all probability distributions on the set X. In conclusion we 
have 

Proposition 1 

C(G) = sup sup ^/u(G m (P)). 

Proof. 

It is sufficient to note that to any two sequences x and y in the vertex set of G m (P) 
there is a permutation of the coordinates of x that transforms x in y and leaves invariant 
the graph G m (P). 

□ 

4 Related open problems 

It seems interesting to ask what happens with our original problem if we replace maximum 
degree with average degree. More precisely, let J 7 be as before, the family of all Hamilton 
cycles on [n]. Let T> a be the family of all graphs with average degree at least a for some 
a > 2. How does M(J-, V a , n) depend on a? 
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